Let's Hear It for Audio Mining
نویسنده
چکیده
T he Web, databases, and other digitized information store-houses contain a growing volume of audio content. Sources include newscasts, sporting events, telephone conversations, recordings of meetings, Webcasts, documentary archives such as the Visual History Foundation's interviews with Holocaust survivors (http://www.vhf. org), and media files in libraries. Users want to make the most of this material by searching and indexing the digitized audio content. In the past, companies had to create and manually analyze written transcripts of audio content because using computers to recognize, interpret, and analyze digitized speech was difficult. However, the development of faster microprocessors , larger storage capacities, and better speech-recognition algorithms has made audio mining easier. Now, the technology is on the verge of becoming a powerful tool that could help many organizations. For example, companies could use audio mining to analyze customer-service and help-desk conversations or even voice mail. Law enforcement and intelligence organizations could use the technology to analyze intercepted phone conversations. Public relations firms could use it to analyze news broadcasts to find coverage of clients. Broadcast companies like CNN and Radio Free Asia are already using audio mining to quickly retrieve important background information from previous broadcasts when new stories break. And a US prison is using ScanSoft's audio mining product to analyze recordings of prisoners' phone calls to identify illegal activity. Several companies such as BBN Technologies (http://www.bbn.com), Fast-Talk Communications (http:// www.fast-talk.com), IBM, and Scan-Soft have released audio mining software , and industry observers expect the number of products to increase during the next few years. However, audio mining's accuracy levels are still relatively low, and some products are expensive, with some high-end software packages costing more than $100,000 for a full-scale deployment. Audio mining, also called audio searching, takes a text-based query and locates the search term or phrase in an audio file. This helps users by, for example, letting them quickly get to specific places in a recorded conversation or determine when a company is mentioned in a newscast. Audio indexing uses speech recognition to analyze an entire file and produce a searchable index of content-bearing words and their locations. This is critical because audio content is in a binary format that is otherwise not readily searchable, explained Robert Weideman, ScanSoft's chief marketing officer. Indexing audio content thus enables searching, said Jeff Karnes, a group product manager for Virage, an audio mining vendor. There are two main approaches to audio …
منابع مشابه
Data mining for gender differences in tinnitus
Copyright and Moral Rights for the articles on this site are retained by the individual authors and/or other copyright owners. For more information on Open Research Online's data policy on reuse of materials please consult the policies page. Abstract—We perform data mining on the publicly available Tinnitus Archive. A number of statistically significant associations with gender were found usi...
متن کاملUsing audio to enhance information tasks
constant communication with an intelligence analyst at operations command, and receives orders through an earpiece to don chemical-protection equipment because of new chemical weapons threat intelligence in the area. The chemical-protection suit is similar to but weighs less than those in use today. Most importantly, because it encloses the soldier's head, the suit has an augmented hearing syst...
متن کاملRotterdam: STT Netherlands Study Centre for Technology Trends. 5.5.2 Musical audio mining
Musical audio mining can be defined as data mining on musical audio. It will allow users to search and retrieve music, not only by means of text queries (such as title, composer, text song, conductor, orchestra), but also by means of content-based musical queries, such as query-byhumming/singing/playing, by specification of a list of musical variables (such as ‘happy’, ‘energetic’, etc.), or by...
متن کاملCroatian Society of Radiology (1928–2008), the Croatian Medical Association – 80 years of existence and activity
OFTEN AND IN VARIOUS CONNOTATIONS ONE CAN HEAR OR READ THE FOLLOWING SYNTAGMA: "Let's leave the past in the past - and turn to the future". Even more frequent and numerous are opposite opinions, e.g. "There is no future without past", "Future is built on past" or "Remembering our past - reaching for our future", and many more.
متن کاملHear&There: An Augmented Reality System of Linked Audio
In this thesis, I designed a system for augmenting a space with linked audio. Using this system, individuals can associate audio clips with a location in real-world space. When an individual using the system passes through this augmented space, he or she can hear the audio clips that have been left by traveling through the associated locations. Furthermore, audio clips in the environment can be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Computer
دوره 35 شماره
صفحات -
تاریخ انتشار 2002